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Abstract 

This paper provides a systematic study of several recently suggested 
measures for online algorithms in the context of a specific problem, 
namely, the two server problem on three colinear points. We examine 
how these measures evaluate the Greedy Algorithm and Lazy Dou- 
ble Coverage, commonly studied algorithms in the context of server 
problems. We examine Competitive Analysis, the Max/Max Ratio, 
the Random Order Ratio, Bijective Analyis and Relative Worst Order 
Analysis and determine how they compare the two algorithms. We 
find that by the Max/Max Ratio and Bijective Analysis, Greedy is the 
better algorithm. Under the other measures Lazy Double Coverage is 
better, though Relative Worst Order Analysis indicates that Greedy 
is sometimes better. Our results also provide the first example of an 
algorithm that is optimal under Relative Worst Order Analysis. 

* Partially supported by the Danish Natural Science Research Council. Part of this 
work was carried out while these authors were visiting the University of California, Irvine. 
f Partially supported by NSF Grant CCR-0514082. 
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1 Introduction 



Since its introduction by Sleator and Tarjan in 1985 [2j, Competitive Anal- 
ysis has been the most widely used method for evaluating online algorithms. 
A problem is said to be online if the input to the problem is given a piece 
at a time, and the algorithm must commit to parts of the solution over time 
before the entire input is revealed to the algorithm. Competitive Analysis 
evaluates an online algorithm in comparison to the optimal offline algorithm 
which receives the input in its entirety in advance and has unlimited com- 
putational power in determining a solution. Informally speaking, we look 
at the worst-case input which maximizes the ratio of the cost of the on- 
line algorithm for that input to the cost of the optimal offline algorithm on 
that same input. The maximum ratio achieved is called the Competitive 
Ratio. Thus, we factor out the inherent difficulty of a particular input (for 
which the offline algorithm is penalized along with the online algorithm) and 
measure what is lost in making decisions with partial information. 

Despite the popularity of Competitive Analysis, researchers have been well 
aware of its deficiencies and have been seeking better alternatives almost 
since the time that it came into wide use. Many of the problems with 
Competitive Analysis stem from the fact that it is a worst case measure 
and fails to examine the performance of algorithms on instances that would 
be expected in a particular application. Other problems come from the 
fact that the ratios tend to be artificially high since online algorithms are 
compared against such a powerful alternative. It has also been observed 
that Competitive Analysis sometimes fails to distinguish between algorithms 
which have very different performance in practice and intuitively differ in 
quality. 

Over the years, researchers have devised alternatives to Competitive Anal- 
ysis, each designed to address one or all of its shortcomings. This paper is a 
study of several alternative measures for evaluating online algorithms that 
have been recently suggested in the literature. We perform this compari- 
son in the context of a particular problem: the 2-server problem on the line 
with three possible request points, nick-named here the baby server problem. 
We look at two algorithms (Greedy and Lazy Double Coverage [6]) 
and four different measures: Bijective Analysis, the Max/Max Ratio, Ran- 
dom Order Ratio and Relative Worst Order Analysis. Our results include 
the first example of an algorithm which is optimal for a particular problem 
under Relative Worst Order Analysis. 
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In the baby server problem, three distinct points on the line are determined 
in advance. There are two mobile servers, each of which occupies a single 
point on the line at any time. A sequence of requests arrives. Each request is 
a name of one of the three designated points on the line. After each request, 
one of the two servers must move to that location. The algorithm decides 
which server to send and the goal is to minimize the total distance traveled 
by the two servers. The Greedy algorithm always sends the server that is 
closest to the requested point. Lazy Double Coverage (to be defined 
more formally later) may move the server that is farther away if the closer 
server has traveled a certain amount since the farther server has last moved. 

In investigating this problem, we find that according to some quality mea- 
sures for online algorithms, Greedy is better than Ldc, whereas for others, 
Ldc is better than Greedy. 

The ones that conclude that Ldc is best are focused on a worst-case sequence 
for the ratio of an algorithm's cost compared to Opt. In the case of Greedy 
and Ldc, this conclusion makes use of the fact that there exists a family of 
sequences for which Greedy's cost is unboundedly larger than the cost of 
Opt, whereas Ldc's cost is always at most a factor two larger than the cost 
of Opt. 

On the other hand, the measures that conclude that Greedy is best compare 
two algorithms based on the multiset of costs stemming from the set of all 
sequences of a fixed length. In the case of Greedy and Ldc, this makes use 
of the fact that for any fixed n, both the maximum as well as the average 
cost of Ldc over all sequences of length n are greater than the corresponding 
values for Greedy. 

Using Relative Worst Order Analysis a more nuanced result can be obtained. 
We show that Ldc can be a factor at most two worse than Greedy, while 
Greedy can be unboundedly worse than Ldc. 

The simplicity of this problem enables us to give the first proof of optimality 
in Relative Worst Order Analysis: Ldc is an optimal algorithm for this case. 

2 Preliminaries 

In this section, we define the server problem used throughout this paper as 
the basis for our comparison. We also define the the server algorithms used, 
and the quality measures which are the subject of this study. 
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2.1 The Server Problem 



Server problems [3] have been the objects of many studies. In its full gen- 
erality, one assumes that some number k of servers are available in some 
metric space. Then a sequence of requests must be treated. A request is 
simply a point in the metric space, and a fc-server algorithm must move 
servers in response to the request to ensure that at least one server is placed 
on the request point. A cost is associated with any move of a server (this 
is usually the distance moved in the given metric space), and the objective 
is to minimize total cost. The initial configuration (location of servers) may 
or may not be a part of the problem formulation. 

In investigating the strengths and weaknesses of the various measures for 
the quality of online algorithms, we define the simplest possible nontrivial 
server problem: 

Definition 1 The baby server problem is a 2-server problem on the line with 
three possible request points A, B, and C, in that order from left to right, 
with distance one between A and B and distance d > 1 between B and C. 
The cost of moving a server is defined to be the distance it is moved. We 
assume that initially the two servers are placed on A and C. □ 

All results in the paper pertain to this problem. 
2.2 Server Algorithms 

First, we define some relevant properties of server algorithms: 
Definition 2 A server algorithm is called 

• noncrossing if servers never change their relative position on the line. 

• lazy [13j if it never moves more than one server in response to a request 
and it does not move any servers if the requested point is already 
occupied by a server. 

A server algorithm fulfilling both these properties is called compliant. □ 

Given an algorithm, A, we define the algorithm lazy A, £A, as follows: 
jCAwill maintain a virtual set of servers and their locations as well as the 
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real set of servers in the metric space. There is a one-to-one correspondence 
of real servers and virtual servers. The virtual set will simulate the behavior 
of A. The initial server positions of the virtual and real servers are the 
same. Whenever a virtual server reaches a request point, the corresponding 
real server is also moved to that point (unless both virtual servers reach 
the point simultaneously, in which case only the physically closest is moved 
there). Also, if a request is made and there is a virtual server but no real 
server already on the requested point then the real server that corresponds 
to the virtual server already on the request point is moved to serve the 
request. Otherwise the virtual servers do not move. 

Proposition 1 For any 2-server algorithm, there exists a noncrossing algo- 
rithm with the same cost on all sequences. 

Proof Observation made in [6]. □ 
The following was also observed in |13j : 

Proposition 2 For an algorithm A, let the lazy version of it be CA. For 

any sequence I of requests, A(J) > CA(I). 

We investigate the relative strength of a number of algorithms for the baby 
server problem: 

We define a number of algorithms by defining their behavior on the next 
request point, p. For all algorithms, no moves are made if a server already 
occupies the request point (though internal state changes are sometimes 
made in such a situation). 

Greedy moves the closest server to p. Note that due to the problem for- 
mulation, ties cannot occur (and the server on C is never moved). 

If p is in between the two servers, Double Coverage (Dc), moves both servers 
at the same speed in the direction of p until at least one server reaches the 
point. If p is on the same side of both servers, the nearest server moves to 
p. 

We define a-Dc to work in the same way as Dc, except that the right-most 
server moves at a speed a < d times faster than the left-most server. 

We refer to the lazy version of Dc as Ldc and the lazy version of a-Dc as 
a-LDC. 
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The balance algorithm [13], Bal, makes its decisions based on the total 
distance travelled by each server. For each server, s, let d s denote the total 
distance travelled by s from the initiation of the algorithm up to the current 
point in time. On a request, Bal moves a server, aiming to obtain the 
smallest possible max s d s value after the move. In case of a tie, Bal moves 
the server which must move the furthest. 

If p is in between the two servers, Dummy moves the server that is furthest 
away to the request point. If p is on the same side of both servers, the nearest 
server moves to p. Again, due to the problem formulation, ties cannot occur 
(and the server on A is never moved). 

2.3 Quality Measures 

In analyzing algorithms for the baby server problem, we consider input se- 
quences I of request points. An algorithm, A, which treats such a sequence 
has some cost, which is the total distance moved by the two server. This 
cost is denoted by A(J). Since / is of finite length, it is clear that there 
exists an offline algorithm with minimal cost. By Opt, we refer to such an 
algorithm and Opt(J) denotes the unique minimal cost of processing /. 

All of the measures described below can lead to a conclusion as to which 
algorithm of two is better. In contrast to the others, Bijective Analysis 
does not indicate how much better the one algorithm might be; it does not 
produce a ratio, as the others do. 

2.3.1 Competitive Analysis 

In Competitive Analysis [9j WR I10j . we define an algorithm A to be c- 
competitive if there exists a constant a such that for all input sequences 
/, A(J) < cOpt(J) + a. 

2.3.2 The Max/Max Ratio 

The Max/Max Ratio [2] compares an algorithm's worst cost for any sequence 
of length n to Opt's worst cost for any sequence of length n. The Max/Max 
Ratio of an algorithm A, wm(^), is M(A)/M(Opt), where 

M(A) = limsupmaxA(/)/t. 

t^oo \I\=t 
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2.3.3 The Random Order Ratio 



Kenyon [TT] defines the Random Order Ratio to be the worst ratio obtained 
over all sequences, comparing the expected value of an algorithm, A, over 
all permutations of a given sequence to the expected value of Opt over all 
permutations of the given sequence: 

r E* [A(a(/))] 

hm sup — — — tjz — 

OPT(/)-»ac ° PT ( J ) 

The original context for this definition is Bin Packing for which the optimal 
packing is the same, regardless of the order in which the items are presented. 
Therefore, it does not make sense to take an average over all permutations 
for Opt. For server problems, however, the order of requests in the sequence 
may very well change the cost of Opt. We choose to generalize the Random 
Order Ratio as follows: 

OPT^oo K [OPT(J)] 

If, on the other hand, one chooses the following generalization, the results 
presented here are the same: 



lim sup E a 

OPT(/)^oo 



Opt(<t(J)) 



2.3.4 Bijective Analysis 

In p], Bijective and Average Analysis are defined, as methods of comparing 
two online algorithms directly. As with the Max/Max Ratio and Relative 
Worst Order Analysis, the two algorithms are not necessarily compared on 
the same sequence. Here, we adapt those definition to the notation used 
here. 

In Bijective Analysis, the sequences of a given length are mapped, using 
a bijection onto the same set of sequences. The performance of the first 
algorithm on a sequence, J, is compared to the performance of the second 
algorithm on the sequence I is mapped to. If I n denotes the set of all 
input sequences of length n, then an online algorithm A is no worse than an 
online algorithm IB according to Bijective Analysis if there exists an integer 
no > 1 such that for each n > no, there is a bijection / : I n —> I n satisfying 
A(J) < 1 (/(/)) for each I € I n . 
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Average Analysis can be viewed as a relaxation of Bijective Analysis. An 
online algorithm A is no worse than an online algorithm B according to 
Average Analysis if there exists an integer uq > 1 such that for each n > no , 
£ JeJn A(J) < E/ e j„B(/). 

2.3.5 Relative Worst Order Analysis 

Relative Worst Order Analysis was introduced in [1] and extended in [5]. 
It compares two online algorithms directly. As with the Max/Max Ratio, 
it compares two algorithms on their worst sequence in the same part of a 
partition. The partition is based on the Random Order Ratio, so that the 
algorithms are compared on sequences having the same content, but possibly 
in different orders. 

Definition 3 Let I be any input sequence, and let n be the length of I. If 
a is a permutation on n elements, then a{T) denotes / permuted by a. Let 
A be any algorithm. Then, A(J) is the cost of running A on /, and 

Aw (I) = max A(<r(/)). 

a 

□ 

Definition 4 For any pair of algorithms A and B, we define 

ci(A,B) = sup{c | 3b: V/: A w (-0 > cB w (J) - b} and 
c u (A,B) = inf {c | 3b: V/: A W (I) < cM w (I) + b} . 

If q(A, B) > 1 or c u (A, B) < 1, the algorithms are said to be comparable 
and the Relative Worst-Order Ratio WRa,b of algorithm A to algorithm B 
is defined. Otherwise, WRa,b is undefined. 

If q(A,B) > 1, then WR a ,b = c u (A,B), and 
if c u (A,B) < 1, then WR a ,b = q(A,B) . 

If WRa,b < 1 , algorithms A and B are said to be comparable in A 's favor. 
Similarly, if WRa,b > L the algorithms are said to be comparable in M's 
favor. □ 

Definition 5 Let c u be defined as in Definitional If at least one of the ratios 
c u (A, B) and c u (B, A) is finite, the algorithms A and B are (c u (A, B), c u (B, A))- 
related. □ 



S 



Measure 


Value 


Competitive Ratio 


A(J) 

CR A = max— — — 
i Opt(J) 


Max/Max Ratio 


MR - maX l-T|=n A ( J ) 
max|//| = „ Opt(J') 


Random Order Ratio 


E a [A(a(I))] 

RRa = max 

J Opt(I) 


Relative Worst-Order Ratio 


max CT A(cj(/)) 

Wi?A,B = max 

I max CT / E(a'(l)) 



Table 1: Comparison of measures 

Definition 6 Let c u (A, B) be defined as in Definition [5j Algorithms A and 

B are weakly comparable in A 's favor, 

• if A and B are comparable in A's favor, 

• if c u (A,B) is finite and c u (B,A) is infinite, or 

• if c u (A,B) G o(c u (B,A)). 

□ 



3 Competitive Analysis 

The /c-server problem has been studied using Competitive Analysis starting 
in [T2]. In [6], it is shown that the competitive ratios of Dc and Ldc are k, 
which is optimal, and that Greedy is not competitive. 

4 The Max/Max Ratio 

In [2], a concrete example is given with two servers and three non-colinear 
points. It is observed that the Max/Max Ratio favors the greedy algorithm 
over the balance algorithm, Bal. 
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Bal behaves similarly to Ldc and identically on Ldc's worst case sequences. 
The following theorem shows that the same conclusion is reached when the 
three points are on the line. 



Theorem 1 Greedy is better than Ldc on the baby server problem with 
regards to the Max/max Ratio. 



Proof Given a sequence of length n, Greedy's maximum cost is n, so 
M(Greedy) = 1. 

Opt's cost is at most n (since it is at least as good as Greedy). Opt's 
maximal cost over all sequences of length n = k(2([d\ + 1) + 1) , for some k, 
is at least 2( 2 [lj rf +t)+i n ' smce cost on ((AB)^ +1 C) h is this much. Thus, 
we can bound M(Opt) by ^JjjL < 2( 2 ffjf)+i ^ ^(Opt) < 1. 
We now prove a lower bound on M(Ldc). 

Consider the sequence s = (BABA...BC) P X, where the length of the alter- 
nating A/B-sequence before the C is 2 [d\ + 1, X is a possibly empty alter- 
nating sequence of As and Bs starting with a B, \X\ = n mod (2 [d\ + 2), 

If \X\ ^ 2 [d] + 1, then Ldc(s) = p(2 L^J + 2d) + \X\ = n + (d ~^~} Xn ■ 

Otherwise, Ldc(s) = p(2 [d\ + 2d) + \X\ = n + {d 'fj^ Xl) + d - 1. 

Since we are taking the supremum, we restrict our attention to sequences 

(d-l)ra 

where \X\ = 0. Thus, M(Ldc) > = 1 + ^ > 1 + = 



Finally, 



while 



M(Greedy) 1 
wm (Greedy) = — ^— — — -— < 



M(Opt) - M(Opt) 

/T , M(Ldc) 2d 
wm(Ldc) = — j— — - > 



M(Opt) ~ (2d+ I)M(Opt) 

Since M(Opt) is bounded, — w ?/i}?^l^„ > -jtti which is greater than 1 for 
d > 1. □ 



Observe from the proof that Greedy is close to optimal with respect to the 
Max/max Ratio, since the cost of Greedy divided by the cost of Opt tends 
toward one for large d. 
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Since Ldc and Dc perform identically on their worst sequences of any given 
length, they also have the same Max/Max Ratio. 



5 The Random Order Ratio 

Theorem 2 Ldc is better than Greedy on the baby server problem with 
regards to the Random Order Ratio. 

Proof The Random Order Ratio is the worst ratio obtained over all se- 
quences, comparing the expected value of an algorithm over all permutations 
of a given sequence to the expected value of Opt over all permutations of 
the given sequence. 

Since the competitive ratio of Ldc is two, on any given sequence, Ldc's cost 
is bounded by two times the cost of Opt on that sequence, plus an additive 
constant. Thus, the Random Order Ratio is also at most two. 

Consider all permutations of the sequence We consider positions 

from 1 through n in these sequences. Refer to a maximal subsequence con- 
sisting entirely of either As or Bs as a run. 

Given a sequence containing h As and t Bs, the expected number of runs is 
1 + (A problem in [8] gives that the expected number of runs of As is 

fttjP , so the expected number of runs of Bs is f ^iffi ■ Adding these gives 
the result.) Thus, with h = t = ^, we get § + 1 expected number of runs. 

The cost of Greedy is equal to the number of runs if the first run is a run 
of Bs. Otherwise, the cost is one smaller. Thus, Greedy's expected cost 
on a permutation of s is § + ^ • 

The cost of Opt for any permutation of s is d, since it simply moves the 
server from C to B on the first request to B and has no other cost after 
that. 

Thus, the Random Order Ratio is which, as n tends to infinity, is 

unbounded. □ 



6 Bijective Analysis 

Bijective analysis correctly distinguishes between Dc and Ldc, indicating 
that the latter is the better algorithm. This follows from the following 
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general theorem about lazy algorithms, and the fact that there are some 
sequences where one of Dc's servers repeatedly moves from C towards B, 
but moves back to C before ever reaching B, while Ldc's server stays on C. 



Theorem 3 The lazy version of any algorithm for the baby server problem 
is at least as good as the original algorithm according to Bijective Analysis. 

Proof By Proposition [21 the identity function, id, is a bijection such that 
CA(I) < A(id(I)) for all sequences I. □ 

Theorem 4 Greedy is better than any lazy algorithm Lazy (including 
Ldc) for the baby server problem according to Bijective Analysis. 

Proof Since Greedy has cost zero for the sequences consisting of only the 
point A or only the point C and cost one for the point B, it is easy to define a 
bijection / for sequences of length one, such that Greedy(J) < Lazy(/(/)). 
Suppose that for all sequences of length k that we have a bijection, /, from 
Greedy's sequences to Lazy's sequences, such that for each sequence / of 
length k, Greedy(I) < Lazy(/(I)). To extend this to length fc+1, consider 
the three sequences formed from a sequence I of length k by adding one of 
the three requests A, B, or C to the end of I, and the three sequences formed 
from f(I) by adding each of these points to the end of /(/). At the end of 
sequence /, Greedy has its two servers on different points, so two of these 
new sequences have the same cost for Greedy as on / and one has cost 
exactly 1 more. Similarly, Lazy has its two servers on different points at 
the end of /(/), so two of these new sequences have the same cost for Lazy 
as on /(/) and one has cost either 1 or d more. This immediately defines a 
bijection /' for sequences of length fc + 1 where Greedy(I) < Lazy(/'(J)) 
for all I of length fc + 1. □ 

Corollary 1 Greedy is the unique optimal algorithm. 

Proof Note that the proof of Theorem 0] shows that Greedy is strictly 
better than any lazy algorithm which ever moves the server away from C, so 
it is better than any other lazy algorithm. By Theorem [31 it is better than 
any algorithm. □ 

Observation 1 If an algorithm is better than another algorithm with re- 
gards to Bijective Analysis, then it is also better with regards to Average 
Analysis pp. □ 
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Corollary 2 Greedy is the unique optimal algorithm with respect to Av- 
erage Analysis. 

Proof Follows directly from Corollary [1] and Observation [TJ □ 

Clearly by definition, the algorithm Dummy permanently leaves a server on 
A. 

Theorem 5 Dummy is the unique worst algorithm among compliant server 
algorithms for the baby server problem according to Bijective Analysis. 

Proof Similar to the proof of Theorem 01 but now with cost d for every 
actual move. □ 

Lemma 1 If a < b, then there exists a bijection a n : {A, B, C} n — > {^4, B, C} 
such that a-LDC(J) < 6-LDC(cr n (J)) for all sequences I G {A, B,C} n . 

Proof We use the bijection from the proof of Theorem showing that 
Greedy is the unique best algorithm, but specify the bijection completely, 
as opposed to allowing some freedom in deciding the mapping in the cases 
where we are extending by a request where the algorithms already have 
a server. Suppose that a n is already defined. Consider a sequence I n of 
length n and the three possible ways, I n A, I n B and I n C, of extending it to 
length n + 1. Suppose that o-Ldc has servers on points X a , Y a G {^4, B, C} 
after handling the sequence I n , and 6-Ldc has servers on points Xh,Y b G 
{^4, B, C} after handling a n (I n ). Let Z a be the point where a-LDC does not 
have a server and Z b the point where 6-Ldc does not. Then a n +i(I n Z a ) 
is defined to be a n (I n )Zb. In addition, since the algorithms are lazy, both 
algorithms have their servers on two different points of the three possible, 
so there must be at least one point P where both algorithms have a server. 
Thus, P G {X a ,Y a } n {X b ,Y b }. Let U a be the point in {X a ,Y a } \ {P} and 
Ub be the point in {X^, Y^} \ {P}. Then, a n+ i(I n P) is defined to be a n (I n )P 
and a n+ i(I n U a ) to be a n (I n )U b . 

Consider running a-LDC on a sequence I n and 6-Ldc on a n (I n ) simultane- 
ously. The sequences are clearly constructed so that, at any point during 
this simultaneous execution, both algorithms have servers moving or neither 
does. 

Clearly, the result follows if we can show that 6-Ldc moves away from 
and back to C at least as often as a-LDC does. By construction, the two 
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sequences, I n and a n (I n ), will be identical up to the point where 6-Ldc (and 
possibly a-LDC) moves away from C for the first time. In the remaining part 
of the proof, we argue that if a-LDC moves away from and back to C, then 
6-Ldc will also do so before a-LDC can do it again. Thus, the total cost of 
6-Ldc will be at least that of a-LDC. 

Consider a request causing the slower algorithm, a-LDC, to move a server 
away from C. 

If 6-Ldc also moves a server away from C at this point, both algorithms 
have their servers on A and B, and the two sequences continue identically 
until the faster algorithm again moves a server away from C (before or at 
the same time as the slower algorithm does). 

If 6-Ldc does not move a server away from C at this point, since, by con- 
struction, it does make a move, it moves a server from A to B. Thus, the 
next time both algorithms move a server, a-LDC moves from B to C and 
6-Ldc moves from B to A. Then both algorithms have servers on A and C. 
Since a-LDC has just moved a server to C, whereas 6-Ldc must have made 
at least one move from A to B since it placed a server at C, 6-Ldc must, 
as the faster algorithm, make its next move away from C strictly before 
a-LDC does so. In conclusion, the sequences will be identical until the faster 
algorithm, 6-Ldc, moves a server away from C. □ 

Theorem 6 According to Bijective Analysis and Average Analysis, slower 
variants of Ldc are better than faster variants for the baby server problem. 

Proof Follows immediately from Lemma [T] and the definition of the mea- 
sures. □ 

Thus, the closer a variant of Ldc is to Greedy, the better Bijective and 
Average Analysis predict that it is. 

7 Relative Worst Order Analysis 

Let Ja denote a worst ordering of the sequence / for the algorithm A. 

Similarly to bijective analysis, relative worst order analysis correctly dis- 
tinguishes between Dc and Ldc, indicating that the latter is the better 
algorithm. This follows from the following general theorem about lazy al- 
gorithms, and the fact that there are some sequences where one of Dc's 
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servers repeatedly moves from C towards B, but moves back to C before 
ever reaching B, while Ldc's server stays on C. If d is just marginally larger 
than some integer, even on Ldc's worst ordering of this sequence, it does 
better than Dc. 

Theorem 7 The lazy version of any algorithm for the baby server problem 
is at least as good as the original algorithm according to Relative Worst 
Order Analysis. 

Proof By Proposition^ for any request sequence /, CA{l£pJ < A(/£^) < 
A(/ A ). " □ 

Theorem 8 Greedy and Ldc are (oo, 2)-related and are thus weakly com- 
parable in Ldc's favor for the baby server problem according to Relative 
Worst Order Analysis. 

Proof First we show that c u (Greedy, Ldc) is unbounded. Consider the se- 
quence (BA)z . As n tends to infinity, Greedy's cost is unbounded, whereas 
Ldc's cost is at most 3d for any permutation. 

Next we turn to c u (Ldc, Greedy). Since the competitive ratio of Ldc 
is 2, for any sequence /, there exists a constant b such that Ldc(/l dc ) < 
2Greedy(Il dc )+6 < 2Greedy(7q reedy )+6. Thus, c u (Ldc, Greedy) < 
2. 

For the lower bound of 2, consider the family of sequences I p = (BABA...BC) P , 
where the length of the alternating A /l?-sequence before the C is 2 [d\ + 1 . 

LDC(/p) =p(2 [d\ +2d). 

A worst ordering for Greedy alternates ^4s and Bs. Since there is no cost 
for the Cs and the A/B sequences start and end with Bs, Greedy(o"(/ p )) > 
p(2 [_d\ ) + 1 for any permutation a. 

Then, c u (Ldc, Greedy) > ffjff? > ^gjr As p goes to infinity, this 
approaches 2. 

Thus, Greedy and Ldc are weakly comparable in Ldc's favor. □ 

Recalling the definition of o-Ldc, a request for B is served by the right-most 
server if it is within a virtual distance of no more than a from B. Thus, when 
the left-most server moves and its virtual move is over a distance of I, then 
the right-most server virtually moves a distance al. When the right-most 



15 



server moves and its virtual move is over a distance of al, then the left-most 
server virtually moves a distance of I. 

In the results that follow, we frequently look at the worst ordering of an 
arbitrary sequence. 

Definition 7 The canonical worst ordering of a sequence, /, for algorithm 
is the sequence produced by allowing the cruel adversary (the one which 
always lets the next request be the unique point where A does not currently 
have a server) to choose requests from the multiset defined from /. This 
process continues until there are no requests remaining in the multiset for 
the point where does not have a server. The remaining points from the 
multiset are concatenated to the end of this new request sequence in any 
order. □ 

It is easy to see what the canonical worst ordering of a sequence for a-LDC 
is. 

Proposition 3 Consider an arbitrary sequence I containing ua As, ub Bs, 
and nc Cs. A canonical worst ordering of / for a-LDC is I a = (BABA...BC) Pa X, 
where the length of the alternating A/iJ-sequence before the C is 2 [£| + 1. 
Here, X is a possibly empty sequence. The first part of X is an alternat- 
ing sequence of ^4s and Bs, starting with a B, until there are not both As 
and Bs left. Then we continue with all remaining As or Bs, followed by all 
remaining Cs. Finally, 



p a = min 



r 


n A 




n B 






ml 


•> 





Lemma 2 Let I a be the canonical worst ordering of / for a-LDC. I a is a 
worst permutation of / for a-LDC, and the cost for a-LDC on I a is c a , where 
p a {2 + 2d) <c a < Pa (2 LfJ + 2d) + 2 L^J + d. 

Proof Consider a request sequence, I. Between any two moves from B to 
C , there must have been a move from C to B. Consider one such move. 
Between the last request to C and this move, the other server must move 
from A to B [^J +1 times, which requires some first request to B in this 
subsequence, followed by at least occurrences of requests to A, each 
followed by a request to B. (Clearly, extra requests to A or B could also 
occur, either causing moves or not.) Thus, for every move from B to C, 
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there must be at least [|J + 1 Bs, |_^J ^4s and one C. Thus, the number of 
moves from B to C is bounded from above by p a . There can be at most one 
more move from C to B than from B to C. If such a move occurs, there 
are no more Cs after that in the sequence. Therefore, the sequences defined 
above give the maximal number of moves of distance d possible. More As or 
Bs in any alternating A/B-sequence would not cause additional moves (of 
either distance 1 or d), since each extra point requested would already have 
a server. Fewer ^4s or Bs between two Cs would eliminate the move away 
from C before it was requested again. Thus, the canonical worst ordering is 
a worst ordering of /. 

Within each of the p a repetitions of (BABA...BC), each of the requests for 
A and all but the last request for B cause a move of distance 1, and the last 
two requests each cause a move of distance d, giving the lower bound on 
c a . Within X, each of the first 2 [^J requests could possibly cause a move 
of distance 1, and this could be followed by a move of distance d. After 
that, no more moves occur. Thus, adding costs to the lower bound gives the 
upper bound on c a . □ 

\<k\+d ( \ i \+d) \ - I 

Theorem 9 If a < b, then o-Ldc and 6-Ldc are ( HH , \ d \ ,[ \ d \ )- 
related for the baby server problem according to Relative Worst Order Anal- 
ysis. 



Proof By Lemma [2j in considering a-LDC's performance in comparison 
with 6-Ldc's, the asymptotic ratio depends only on the values p a and pb 
defined for the canonical worst orderings I a and for a-LDCand 6-Ldc, 
respectively. Since a > b, the largest value of |^ occurs when p a = nc, 
since more Cs would allow more moves of distance d by 6-Ldc. Since the 
contribution of X to o-Ldc's cost can be considered to be a constant, we 
may assume that ua = nc |_^J and «b = ^c(|_^J + 1)- 

When considering 6-Ldc's canonical worst ordering of this sequence, all the 
Cs will be used in the initial part. By Lemma [21 we obtain the following 
ratio, for some constant c: 

(2^j+2rf)n c 
(2 + 2d)n c + c 

Similarly, a sequence giving the largest value of — will have pb = t¥t, since 

Pa Lb J 

more ^4s would allow a-LDC to have a larger p a . Since the contribution 
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of X to 6-Ldc can be considered to be a constant, we may assume that 
nA = n c [f\ , n B = n c ( [f J + 1), and p b = n C - 

Now, when considering a-LDC's worst permutation of this sequence, the 
number of periods, p a , is restricted by the number of ^4s. Since each period 

After this, there are a constant number 



has UJ As, p 



TIT 



of As remaining, giving rise to a constant additional cost d . 
Thus, the ratio is the following: 

C2 Lf J + 2 d)n c 



(2 H\ + 2d) 



Lf 

Li 



Considering the above ratios asymptotically as nc goes to infinity, we obtain 



that a-LDC and a-LDC are ( 



+d ( 



+d' ( 



)-related. 



□ 



We provide strong indication that Ldc is better than 6-Ldc for 6^1. If 
6 > 1, this is always the case, whereas if b < 1, it holds in many cases, 
including all integer values of d. 

Theorem 10 Consider the baby server problem evaluated according to Rel- 
ative Worst Order Analysis. For b > 1, if Ldc and 6-Ldc behave differently, 
then they are (r, r&)-related, where 1 < r < r&. If a < 1, a-LDC and Ldc 
behave differently, and d is a positive integer, then they are (r a , r)-related, 
where 1 < r a < r. 



Proof By Theorem [9l a-LDC and 6-Ldc are 



d 
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+d ( 


d 
h 


+d) 


d 
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d 
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+d' ( 
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+d) 


d 
h 



-related. 



Subtracting these two values and comparing to zero, we get 

< 



d 

a 


+d ( 


d 
b 


+d) 


d 
. a 


d 

b _ 


+d ( 


d 
_ a _ 


+d) 


d 



t 

e = ([i\+dm+d)[i\-([i\+d)([i\+d)[i\<o 

We can rewrite E = ([|J - Lf J ) ( L#J LfJ " 

Observe that the first term is positive when b > a and the algorithms behave 
differently. 
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We now analyze Ldc compared with variants of other speeds. First, assume 
that a = 1 and b > 1. 

Since the algorithms are different, [f J < |JJ - 1. Thus, the second factor, 
La J Lf J ~ d 2 , is bounded from above by d(d - 1) - d 2 < 0. Hence, if Ldc 
and 6-Ldc are (c±, C2)-related, then c\ < ci- 

Now, assume that a < 1 and 6=1. 

Again, the algorithms are different, so [f J < [£| ~ L 

Note that if |_^J [d\ > d 2 , then a-LDC and Ldc are (ci, C2)-related, where 
C2 > c±. This holds when d is a positive integer. □ 

There is an indication that a-LDC and ^-Ldc are in some sense of equal 
quality: 

Corollary 3 When ^ and ^ are integers, then a-LDC and 6-Ldc are (b, 6)- 
related when b = -. □ 

a 

We now set out to prove that Ldc is an optimal algorithm in the following 
sense: there is no other algorithm A such that Ldc and A are comparable 
and A is strictly better or such that Ldc and A are weakly comparable in 
A's favor. 

Theorem 11 Ldc is optimal for the baby server problem according to Rel- 
ative Worst Order Analysis. 

Proof Since the competitive ratio of Ldc is 2, for any algorithm A and any 
sequence /, there is a constant b such that Ldc(/l dc ) ^ 2^(^Ldc) + b < 
2A(7a) + b. Thus, c u (Ldc, A) < 2, so A is not weakly comparable to Ldc 
in A's favor. 

For A to be a better algorithm than Ldc according to Relative Worst Order 
Analysis, it has to be comparable to Ldc and perform more than an addi- 
tive constant better on some sequences (which then necessarily must be an 
infinite set). 

Assume that there exists a family of sequences S\ , S*2 , . . . such that for any 
constant c there exists an i such that LDCw{Si) > Aw {Si) + c. 

Then we prove that there exists another family of sequences S[, S' 2 , . . . such 
that for any constant c there exists an i such that A^(S , t / ) > LDCw(S^) + c. 
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This establishes that if A performs more than a constant better on its worst 
permutations of some family of sequences than Ldc does on its worst per- 
mutations, then there exists a family where Ldc has a similar advantage 
over A which implies that the algorithms are not comparable. 

Now assume that we are given a constant c. Since we must find a value 
greater than any c to establish the result, we may assume without loss of 
generality that c is large enoug h that 3dc > fjj&ildj + 2d) + 3d. 

Consider a sequence S from the family Si, 5*2, . . . such that LdCw(S) > 
Ayy(S) + 3dc. From S we create a member S' of the family S[, S' 2 , . . . such 
that A W (S') > LDCw(S') + c. 

Assume that S contains ha As, ub Bs, and tlq Cs. 

The idea behind the construction is to allow the cruel adversary to choose 
points from the multiset defined from S as in the definition of canonical 
worst orderings. This process continues until the cruel adversary has used 
all of either the As, Bs, or Cs in the multiset, resulting in a sequence S'. If 
the remaining points from the multiset are concatenated to S' in any order, 
this creates a permutation of S. The performance of A on this permutation 
must be at least as good as its performance on its worst ordering of S. 

We now consider the performance of Ldc and A on S' and show that Ldc 
is strictly better. 

Let n' A , n' B , and n' c denote the number of As, Bs, and Cs in S', respectively. 



By Lemma [2j the cost of Ldc on its canonical worst ordering of S' is at 
most p{2 [dj + 2d) + 3d. 

The cost of A is 2dn' c + n' A + n' B — n' c , since every time there is a request 
for C, this is because a server in the step before moved away from C. These 
two moves combined have a cost of 2d. Every request to an A or a B has 
cost one, except for the request to B immediately followed by a request to 
C, which has already been counted in the 2dn' c term. A similar argument 
shows that Ldc's cost is bounded from above by the same term. 

Assume first that j& = pj+i = n 'c- Then S' can be permuted so that it is 
a prefix of Ldc's canonical worst ordering on S (see Lemma [2] with a = 1). 
Since, by construction, we have run out of either As, Bs, or Cs (that is, 
one type is missing from S minus S' as multisets), Ldc's cost on its worst 
ordering of S is at most its cost on its worst ordering on S' plus 3d. Thus, 
LDCw (Si) > Aw(Si) + c does not hold in this case, so we may assume that 
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these values are not all equal. 
Case n' A > [d\p: 

Ldc's cost on <S" is at most the cost of A minus (n' A — |_dj p) plus 3d. 
Case n' B > ([d\ + l)p: 

Ldc's cost on S' is at most the cost of A minus (n' B — ([d\ + l)p) plus 3d. 
Case n' c > p: 

Ldc's cost on S' is at most the cost of A minus (2d — l)(n' c — p) plus 1. 

We compare Ldc's canonical worst orderings of S and S' . In both cases, the 
form is as in LemmaEl with a = 1. Thus, for S' the form is (BA)^BC) P X, 
and for S, it is ((B A)^ BC) p+l Y for some positive integer I. The string X 
must contain all of the As, all of the Bs and/or all of the Cs contained in 
((B A)L d J BC) 1 , since after this the cruel adversary has run out of something. 
Thus, it must contain at least I \_d\ As, l([d\ + 1) Bs or / Cs. The extra cost 
that Ldc has over A on S is at most its cost on ((BA)^ d ^ BC) l Y minus cost 
I [d\ for the ^4s, -Bs or Cs contained in X, so at most 1(2 \_d\ +2d)—l [d\ +3d = 
l([d\ +2d) + 3d. 

Thus, Ldcw(S) - A W (S) < l([d\ + 2d) + 3d while A W (S') - Lnc w (S') > 
/ [d\ - 3d. 

From the choice of c and the bound just derived, we find that 

3d , + 1 (Id] +2d) +3d < 3dc < Ldc w (S) - A W (S) < l(\d\ + 2d) + 3d 
[dj-1 

Thus, I > nfnzi ' w hich implies the following: 



I > 



3d+l 



Ldj-i 
t 

l([d\-l) > 3d+l 

t 

I [d\ - 3d > I + 1 

t 

i [d ]-3d > mgHgfj 



Now, 



4 / /-y/N T , „, v ,,„ , I (\ d \ + 2d) + 3d 3dc 
A W (S') - Ldc w (S') >l[d\-3d> VL J M > > — = c. 



□ 
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The proof of optimality only depends on the form of the canonical worst 
orderings for the optimal algorithm and the costs associated with these. 
More specifically, the canonical worst orderings produced using the cruel 
adversary are worst orders, and these sequences have the form R ■ Q\ ■ 
Q2 ■ ■ ■ Qk'X, where R has constant cost Qi has the form (BA) Vi BC with / < 
t% < I + 1 for some integer I, and X cannot include enough ^4s, Bs and Cs so 
that the sequence could instead have been written R ■ Q\ ■ Q2 ■ ■ ■ Qk • Qk+i • X' 
for some Qk+i having the same form as the others and for some X' . Since the 
cruel adversary is used, the patterns Qi are uniquely defined by the algorithm 
and every request in each of them has a cost. In the above proof, all the Qi 
were identical. As an example of where they are not all identical, consider 
d = 2|, where the canonical worst ordering for Bal has the following prefix: 

B ABC (B AB AB ABC B AB ABC)* 

Note that this implies that a-LDC and Bal are also optimal algorithms. 

In the definitions of Ldc and Bal given in Section [2J different decisions are 
made as to which server to use in cases of ties. In Ldc the server which is 
physically closer is moved in the case of a tie (equal virtual distances from 
the point requested). The rationale behind this is that the server which 
would have the least cost is moved. In Bal the server which is further away 
is moved to the point. The rationale behind this is that, since d > 1, when 
there is a tie, the total cost for the closer server is already significantly higher 
than the total cost for the other, so moving the server which is further away 
evens out how much total cost they have, at least temporarily. With these 
tie-breaking decisions, the two algorithms behave very similarly when d is 
an integer. 

Theorem 12 Ldc and Bal are not comparable on the baby server problem 
with respect to Relative Worst Order Analysis, except when d is an integer, 
in which case they are equivalent. 

Proof One can see that a canonical worst ordering of a sequence I for Bal 
has the following prefix: R ■ Q\ ■ Q2 ■ ■ ■ Qk, where for all i, Qi = (BA) Ti BC, 
where [d\ < rj < [~ef| and R contains at most two Cs. 

Suppose that d is an integer. Consider any request sequence /. Ldc's 
canonical worst ordering has the prefix {{BA) d BC) k , while Bal's canonical 
worst ordering has the prefix BABC(BA) ^f^ 1 BC{(BA) d BC) k ' . These 
prefixes of Ldc's and Bal's canonical worst orderings of / are identical, 
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except that R is empty for Ldc, while Bal has R = BABC{BA)^~^~ 1 BC. 
This leads to some small difference at the end, where Bal might have at most 
two fewer of the (BA) d BC repetitions than Ldc. Thus, their performance on 
their respective worst orderings will be identical up to an additive constant 
due to the differences at the start and end of the prefixes. 

Now consider the case where d is not an integer. Following the proof of The- 
orem [Til consider a sequence, /, where the canonical worst ordering for Bal 
consists entirely of the prefix form described above. Since d is not an inte- 
ger, not all of the QiS will have the same length. The difference from Ldc's 
canonical worst ordering is that there are some Qi of the form (BA)^BC, 
so in their respective prefixes, Bal has more ^4s and Bs, compared to the 
number of Cs, than Ldc does. This means that Ldc will have a lower cost 
on its worst ordering. However, the sequence, produced by letting the 
cruel adversary for Ldc choose points from the multiset defined by this se- 
quence, /, has all Qi = (BA)^BC. Bal's canonical worst ordering of I' 
will have fewer QiS because there will not be enough As to make the longer 
repetitions required. Therefore, the two algorithms are incomparable when 
d is not an integer. □ 



8 Open Questions 

It appears that both Bijective Analysis and Relative Worst Order Analysis 
have the advantage over Competitive Analysis that if one algorithm does at 
least as well as another on every sequence, then they decide in favor of the 
better algorithm. This works for the lazy vs. non-lazy version of the baby 
server problem, and also for some paging problems — LRU vs. FWF and 
look-ahead vs. no look-ahead. See [7] and [5] with regards to these results 
on Bijective Analysis and Relative Worst Order Analysis, respectively. 

However, Bijective Analysis does not reflect the fact that when comparing 
on the same sequence, even though Greedy can be a factor two better than 
Lazy Double Coverage, LDC is sometimes much better than Greedy for the 
baby server problem. Similarly, according to Bijective Analysis, LIFO and 
LRU are equivalent for paging, but LRU is often significantly better than 
LIFO, which keeps the first k — 1 pages it sees in cache forever. 

In both of these cases, Relative Worst Order Analysis says that the algo- 
rithms are weakly comparable in favor of the "better" algorithm. It would 
be interesting to see if there are problems where Bijective Analysis gives 
better results than Relative Worst Order Analysis. For instance, can local- 
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ity of reference be added to Relative Worst Order Analysis and succeed in 
separating LRU from FIFO, as one can by adding locality of reference to 
Bijective Analysis pQ? 

Is it possible to develop a set of general guidelines to determine which of the 
measures studied here is best used in which context? 

So far there seems to be basically one technique for proving results in Bijec- 
tive Analysis, and it works when you can compare two algorithms and say 
that among all the next possible requests, the one algorithm has at least 
as many low cost requests as the other. This works for demand paging [3], 
since all such algorithms have the same number of pages in the cache and 
the same number currently not in the cache. This works for Greedy vs. 
other algorithms for our problem. And this works for list accessing |7J, since 
each algorithm has exactly one item with each possible cost in the list. It 
seems hard to apply this to bin packing or even to more complicated fc-server 
problems. It would be interesting to come up with a new technique. 

Finally, we have investigated the smallest possible server problem in order 
to evaluate the differences between various measures. We believe, we would 
obtain the same classifications on more general server problems, but gen- 
eralizations of some of the results presented here would require significant 
technical effort. It would also be interesting to prove optimality of some 
algorithm for a more general problem. 
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